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In order that this bulletin 

may reflect in as complete and timely manner as possible 

the initiatives and activities carried out 

by each and all the countries of the region 

in relation to the Major Project of 

Education in Latin America and the Caribbean, 

pertinent official bodies are invited 

to send to 

the UNESCO Regional Office for Education in 
Latin America and the Caribbean 
all information they wish 
to have published in this bulletin. 



The views expressed in the signed articles are those of their authors, 
and are not necessarily shared by UNESCO. 

Reprint of this publication is authorized mentioning its source. 



The publication of Ms bulletin has been made possible by the voluntary contribution of the 
Spanish Government to the activities of the Major Project of Education 
in Latin America and the Caribbean. 
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Presentation 



Strategies to improve the quality of education are taking on 
multidimensional qualities. Consensus has broadened appreciably on 
the need to deal with this challenge at a curricular, institutional, 
administrative and teaching level. This issue of the Bulletin contains a 
series of contributions on this theme. 

First, we present a study by Robin Horn, Laurence Wolff and Eduardo 
Velez that summarizes the techniques which have proven most successful 
in implementing and using educational evaluation systems, and that 
suggests some future courses of actions in this field. It is based on the 
premise that information obtained through educational evaluation is so 
positive for the system that it is possible to reap enormous benefits if it is 
linked to a broader and more comprehensive programme for enhancing 
quality. 

This provides the framework for the report ff Programmes for improving 
quality in elementary schools in popular sectors” , in which the Ministry 
of Education of Chile gives an account of its recent efforts in this area. 
The programme is based on the concept of (f positive discrimination” . It 
effectively demonstrates the potential of measurement systems to 
adequately target compensatory actions for which the central 
administration is responsible. 

Indigenous groups are among the most important target populations of 
the Major Project. Bilingual intercultural education is a strategy that 
has proven significantly productive in this respect, from a curricular and 
methodological standpoint. The article entitled Bilingual education 
beyond national frontiers, by Luis Enrique Lopez and Lucia D’ Emilio, 
presents the case of the positive horizontal cooperation between two 
countries in the region, working to implement an intercultural bilingual 
education programme for Aymara- and Quechua-speaking peasant 
populations. 

The article by Maria de Ibarrola analyzes the new tasks that lie ahead 
for technical and professional education at high school level in the 
region. She recalls the extensive and costly experience that exists on the 
subject, and stresses the urgency of recouping it. This researcher states 
that high school technical education institutions require a new 
educational administration which clearly pinpoints the actors 
responsible for bringing about innovation and change, and which 
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pursues the building of institutional levels that guarantee the quality of 
new technological training. 

Olabisi Kuboni presents an analysis of the relation between education 
and work from the standpoint of functional illiteracy among youths and 
adults in Trinidad and Tobago. His article sets forth the need to increase 
the target population’ s capacity to integrate with its socio-economic 
environment. At the same time he mentions and discusses some specific 
skills. 

As usual, we also include the section on OREALC Activities. 




EDUCATIONAL ASSESSMENT SYSTEMS IN 

LATIN AMERICA. A REVIEW OF ISSUES AND RECENT 

EXPERIENCE 



Robin Horn, Laurence Wolff and Eduardo Velez* 



The main objective of this review is to inform education decision-makers in Latin 
America of the range and complexity of the issues involved in developing and imple- 
menting national educational assessment systems in this region. This review summa- 
rizes best practice in implementing and utilizing educational assessments, reviews the 
experience of several Latin American countries in implementing national assessment 
systems, and comes up with a set of concrete suggestions for the future of educational 
assessment systems for Latin American countries. These suggestions are designed to 
help ensure that educational assessments can in fact influence policy makers and 
teachers to improve the quality of education. This review is timely because many 
countries in Latin America and the Caribbean are currently embarking on programs to 
assess educational achievement in primary education. Given their economic problems 
and resulting constraints on public finances, these countries need to direct their 
resources toward cost effective investments, programs, and interventions in education. 
Specifically, two countries, Chile and Costa Rica, have conducted national assessments 
for the primary level. Honduras, the Dominican Republic, Brazil, Jamaica, Colombia 
and Mexico have undertaken or are planning a variety of somewhat more limited 
assessments of student learning. These countries, as well as Belize, Ecuador, Northeast 
Brazil and the state ofSao Paulo, are considering establishing larger, more systematic 
assessment systems. 



Definitions 

National assessments of student learning meas- 
ure the educational performance of a nation’s or 
region’s students and evaluate the progress of 
the schools, school districts, municipalities, or 
states in achieving curricular or other goals of 
the education system. National assessment sys- 
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terns differ from national certification examina- 
tion systems. The objective of national certifi- 
cation examinations is to identify students who 
have successfully completed a certain level of 
education or, alternatively, to select completers 
for entry into the next higher educational level 
(e.g., from primary to secondary). To meet their 
objective, national examination systems must 
use the examination instrument to evaluate all 
students seeking a completion certificate or 
seeking selection into the next higher level of 
education, for which there may be a restricted 
number of openings. 

Testing for selection occurs in many Latin 
American countries at the end of the secondary 
cycle, as a prerequisite for entrance to univer- 
sities. These testing and selection examinations 
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generally are organized in such as way so as to 
provide as little feedback as possible to students 
and teachers, beyond giving them a grade. This 
is in part because the designers of such tests are 
especially concerned with security and test ad- 
ministration. Generally , Latin America countries 
do not have national or regional tests at the 
primary level for purposes of certification or 
selection to secondary education. The exceptions 
include Mexico, the Anglophone Caribbean 
countries, and the Bogota metropolitan area of 
Colombia (for entrance into public secondary 
schools in Bogota). In most of Latin America 
individual teachers make judgments regarding 
whether children will pass or fail at all levels of 
primary education. 

If tests are already being administered for 
purposes of certification or selection, it may be 
possible to redesign or restructure the instruments 
and the overall design in such a way as to serve 
some of the same objectives of an assessment 
system. In fact, the first step in considering 
whether to set up an assessment system in a 
country is to determine whether existing certi- 
fication and selection tests can be adapted to 
serve assessment objectives. Piggy-backing on 
an existing testing program may be a cost- 
effective alternative to building a parallel as- 
sessment system, and should not be over- looked 
in any strategy for achieving national educational 
goals. 

It is also important to differentiate between 
educational assessments, educational statistics, 
and educational research, all of which provide 
information of potential use to policy makers 
and practitioners. Typical educational statistics 
systems often cover such elements as enrollments 
by grade and age, numbers of student grade 
repetitions, estimates of student flow, teacher 
characteristics, physical facilities, equipment 
and materials , and educational costs. The advent 
of the personal computer means that more data 
can be gathered within a shorter period of time, 
and fairly sophisticated analyses can be under- 
taken to guide policy makers. Latin American 
countries, as is the case with most developing 
countries, are only now beginning to go beyond 



the rudimentary phase of educational statistical 
analysis. 

Educational research is defined as the rigor- 
ous testing of hypotheses about causal relation- 
ships. In particular, much educational research 
attempts to measure the determinants of student 
learning and achievement. State of the art 
educational research requires complex sampling 
designs, sophisticated analytical expertise, and 
as well as sophisticated computer modeling tools. 
Educational research, especially when under- 
taken on a longitudinal framework, can be quite 
expensive. In principle the aim of educational 
assessments is not to support the testing of 
hypotheses, although such hypothesis testing 
can well be incorporated into them. In practice, 
as discussed below, the data developed by an 
assessments so rich thatsignificantbenefits can 
be achieved if these data are linked with a 
comprehensive research program. 

How Educational Assessments Can Be Used 

Obviously , by itself, measuring student learning 
does not yield increased achievement any more 
than weighing grain will yield increased 
agricultural output. It is, however, a necessary 
condition to establishing quantitative targets, 
assessing the tradeoffs of alternative allocation 
strategies, input combinations, and production 
technologies, and allocating resources and ef- 
fort to achieve established targets. The potential 
uses of educational assessment are described 
below. Many of the uses of assessments can be 
found in the United States, which has the longest 
history and experience with assessments. 

Using Assessments to Garner Public Support 
for Increased Efforts to Improve Educational 
Quality 

An educational assessment system that meas- 
ures student learning and reports on how well 
student learning matches clearly stated and 
understood performance standards helps the 
public understand what students know, and al- 
lows them to track progress. Policy-makers can 
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set clear targets, and the public can monitor 
progress toward these targets. In particular, if 
the system is found to be deficient, then addi- 
tional funding and changed emphases can be 
justified. 

If assessment learning targets are set too high, 
then the public may become disillusioned when 
reports indicate that very few students or schools 
(or only particular groups of student or schools) 
are achieving the targets. This dimension sug- 
gests careful planning. Involving a broad 
spectrum of well respected educationalists and 
leaders in the goal setting exercises, as well as in 
the assessment policy board, and eliciting the 
participation of the press throughout the process, 
will help assure the public that the national 
assessment system and the national educational 
goals are developed in a fair and open process. It 
also suggests that the audience(s) for the as- 
sessment reports should be clearly identified 
from the start, even before the assessment in- 
struments are designed, to assure that the kinds 
of questions likely to be raised by each audience 
can be answered by the assessment. 

Using the Backwash Effects of an Assessment 

An indirect way in which assessments can affect 
teaching and learning is through their “backwash” 
effects. Backwash effects is the term used to 
describe the effects that an examination has on 
what teachers teach and what students make the 
effort to learn. “Teaching to the test” exempl ifies 
backwash effects. Backwash effects are always 
present in a school system that relies on an 
examination to certify completion of one level 
of education or to determine selection into a 
subsequent level of schooling (i.e., secondary to 
post-secondary), or into a particular training 
program (nursing or mechanical training). In 
many instances, the backwash effects of a 
critically important selection examination reach 
back through many years of schooling, such as 
a university entrance examination that effects 
teaching from upper secondary down through 
primary school. If the examination is designed 
to measure the entire curricular domain, with a 



variety of academic and nonacademic 
applications, and higher order thinking skills, 
then it would make sense to capitalize on the 
backwash effects of the examination. 

Assessments, as discussed above, are not usu- 
ally combined with selection. Therefore, a 
sample based assessment system whose out- 
comes do not yield consequences for the students 
taking the assessment tests, may not produce 
any backwash effects per se. Nonetheless, the 
assessment system may be designed to encour- 
age changes in the behavior of students or teachers 
similar to those yielded by backwash effects. 
First, if students, teachers, and parents take 
educational examinations seriously, then as- 
sessments may well exert powerful influence on 
what is taught and what is learned. By designing 
assessment instruments to examine the skills 
and knowledge areas that all students should 
acquire, rather than only those skills relevant for 
students likely to advance into the next level of 
the system, the schools will increasingly find 
themselves providing instruction that benefits 
all children. Second, assessments may be de- 
signed to provide rewards, of financial or other 
resources, or of prestige which may produce 
backwash-like effects. Finally, countries may 
administerassessment instruments toall students 
in all schools and then seek to combine assess- 
ment and certification objectives in a single 
system, measuring the achievement of national 
educational targets while simultaneously as- 
suring backwash effects. 

A variation on the backwash effects of testing 
would be “minimum competency” testing. For 
example, standards in the state of Alabama were 
set indicating the minimal level of performance 
on the Alabama assessment required of gradu- 
ating secondary school students in each subject 
area. The minimal standards targets were not 
established to punish poorly performing students. 
Rather, they were designed to help district officers 
and school principals monitor the progress of 
Alabama students and to make sure that they 
achieve a reasonable standard of education. 
Research in Alabama shows that students at all 
ability levels, even advanced students, regis- 
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tered marked improvements in assessment per- 
formance. 

Diagnosing and Treating Learning Problems 
and Improving InstructionalDesign andTeacher 
Training 

Another way in which assessment systems can 
be designed to improve educational practice is 
by providing content diagnosis and specific 
feedback from the assessment system to teach- 
ers, parents, students, and the community. By 
supplementing the feedback system with an 
analysis of the types of errors students make and 
with suggestions on techniques for improved 
teaching to help students avoid these types of 
errors, the educational assessment system can 
have a direct and positive influence on student 
learning. 

This was done with the Kenya examination 
reform implemented in the late 1970s, with 
successful results (Somerset, 1987)-a classic 
case of utilizing a selection system to meet 
assessment- type objectives. First the examina- 
tions were revised to assess a broader range of 
skills and higher order thinking skills. Then 
these new instruments replaced the older ex- 
aminations which were mainly measures of 
students’ rote recall. Subsequently, individual 
examination items were carefully analyzed to 
determine the types of skills in which Kenyan 
students were weak and appeared to consistently 
lack comprehension. For instance, by means of 
an analysis of the examination items, it was 
discovered that while students were measuring 
up wel 1 in the area of formal mathematics theory, 
they were not capable of applying their math 
skills to everyday situations. It was suggested 
that the curriculum and teachers were being too 
theoretical and were devoting too little time to 
practical applications. In one issue of the 
newsletter regularly distributed to the teachers 
as part of the examination reform program, 
teachers were advised of the problem, were 
given examples of the types of errors students 
were making, and were told of more effective 
means of teaching the necessary skills. Annex 1 



providesan example of the feedback from Kenya. 
The Kenya feedback system has been criticized 
as helping students to answer examination 
questions better rather than improving learning. 
However, to the extent that examination ques- 
tions measure fundamental higher level skills 
this is not a valid criticism. 

Similarly educational assessment systems can 
be used to provide information to educators that 
can help them improve instructional design and 
classroom pedagogy. Recently, a number of 
states in the USA have developed in their as- 
sessment system a n umber of indicators designed 
to identify and accurately measure performance 
in a number of narrowly defined subject area 
domains. In this way, educators can tell if 
particular practices, teacher training approaches, 
or curricular materials are effective. Based oi} 
this approach, for example, a school system can 
redesign its in-service teacher training system in 
a way that the assessment analysis indicates is 
most effective. 

Rewarding Good Performance 

Assessments can also be used to reward good 
performance or help those with bad perform- 
ance to improve. Results from assessments have 
been used to hold regions, municipalities, dis- 
tricts, or even schools and school leaders (given 
the appropriate sample) accountable for per- 
formance in a number of states and districts in 
the USA. Student assessments cannot be used to 
hold teachers accountable for good teaching 
because it is not unlikely that differences in the 
quality of students taught by teachers being 
evaluated can be statistically controlled (Glass, 
1974). For example, in Boston and South Caro- 
lina, districts showing significant improvement 
in assessment scores (taking initial conditions 
and school social indicators into account) are 
rewarded with additional finances; those dis- 
tricts that continue to perform below a certain 
standard are sanctioned by transferring their 
authority to run their schools to an outside team 
of experts. Another example was Michigan’s 
1973 Chapter III plan (for educationally 
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disadvantaged students). The plan was designed 
to initially allocate funds on the basis of schools 
with high numbers of poorly performing students, 
and then to allocate funds on the basis of student 
progress from baseline levels. 

One of the problems associated with systems 
of rewards and punishments is that teachers, 
principals, and districts may have greater in- 
centives to cheat- to record or report higher scores 
that those which wereor would have been actually 
earned. They can justify cheating if they believe 
it will help them obtain resources for their stu- 
dents or if they believe that the system is unfair. 
Obviously, test security would have to be seri- 
ously considered when implementing this type 
of plan. Another problem associated with using 
accountability as a mechanism to raise school 
quality is that it would require all the nation’s 
schools to be included in the sample (unless 
district-level accountability is sufficient). 
Moreover, the use of the assessment to assign 
rewards or to dole out punishments would lend 
the system a legitimacy for which legislators or 
other policy-makers may not be initially prepa- 
red. These burdens suggest that funding deci- 
sions-certainly on a school by school basis- 
should not be a part of a new assessment system, 
and should only be carefully introduced under 
any circumstances. 

Using National Assessments in Educational 
Research and Development 

As noted above the primary aim of educational 
assessments is to measure progress in meeting 
the curriculum goals of the educational system- 
not to support the testing of research hypotheses. 
However, considering the richness of the out- 
come measures gathered by an assessment, the 
marginal effort needed to generate and test re- 
search hypotheses suggests that all assessments 
should include a research component. 

It is true that assessments do not and cannot 
effectively build in all of the statistical controls 
necessary to assure that good research can be 
conducted (especially as these controls are 
constantly being discovered or refined). None- 
theless, national assessment efforts produce an 



immensely powerful set of outcome measures. 
To the extent that assessments also provide an 
array of control and background information on 
students and family characteristics, teacher 
characteristics, and classroom factors, along with 
information on costs, it will be possible for 
researchers to assess the effectiveness of alter- 
native inputs into the schooling process, and 
identify cost-effective solutions to improving 
student performance. Moreover, a key compo- 
nent of such a research agenda is analysis that 
attempts to find links between educational 
achievement and labor market productivity or 
other social outcomes. This type of research 
would al low policymakers to assess the economic 
rates of return associated with different measures 
of educational skills acquisition. However, much 
of this type of analysis requires fairly advanced 
research capacity. A fundamental strategy in the 
building of an assessment system should there- 
fore be the funding of a research capability that 
may include skilled and experienced subject 
matter specialists, educational analysts, statisti- 
cians, psychometricians, econometricians, 
sampling experts, and computer programmers. 
However, a country that lacks a state of the art 
educational research apparatus need not be 
constrained to start one without a sophisticated 
team of highly advanced researchers and re- 
search tools since adequate and useful research 
can be done without the use of the most sophis- 
ticated techniques. Nonetheless, the more com- 
plex the questions being asked of the data, the 
more advanced the research methodology requi- 
red to answer them accurately, and the more 
sophisticated the research system will need to 
be. 

In short, while assessment systems are not 
specifically designed as pure research instru- 
ments, they could be designed in such a way as 
to collect data that can be used by the research 
community to conduct basic research on edu- 
cation, to analyze the effectiveness of various 
educational inputs, and to estimate cost-effective 
ways of increasing student learning. Adequate 
funding should be built into the system to ensure 
that such research is carried out. 




11 



BULLETIN 27, April 1992 / The Major Project of Education 



Technical and Management Issues in 
Educational Assessments 

The following are the main issues and decisions 
which will need to be taken in the course of 
implementing educational assessments. 

Defining Objectives 

The objectives of educational assessments de- 
termine what should be measured and how it 
should be measured. These objectives will 
therefore need to be clearly articulated from the 
start. As noted above assessment objectives 
could include: monitoring and reporting on the 
nation’s progress towards established perform- 
ance targets; providing quantitative ammuni- 
tion to encourage or to direct districts or schools 
to improve performance; furnishing educators 
with data and research for the purpose of 
diagnosing and treating learning problems or of 
changing instructional design and teacher 
training; and developing materials thatencourage 
teachers to improve their instructional content 
and pedagogical practices. A decision will be 
required on what levels and subjects would be 
tested. A realistic five-year implementation 
plan will need to be prepared on the basis of the 
objectives and modalities selected. A 
dissemination and follow up plan should be an 
integral part of the design. 

The Nature of the Testing Agency 

A decision will need to be taken on the location 
and nature of the testing agency. One alternative 
is to locate all the testing expertise in a 
government agency. This may be necessary if 
private institutions do not exist. Another alter- 
native is to keep the central assessment agency 
in Government very lean, with a few high qual- 
ity staff, and to contract out the major work of 
undertaking an assessment to a non-profit au- 
tonomous institution. This second approach will 
help to ensure flexibility and high quality of 
technical personnel. Usually an independent 
assessment policy board overseeing the entire 



assessment process is established, as well as 
subject councils and advisory committees, one 
for each subject area or academic discipline, to 
elucidate specificexamination objectives within 
each area. 

Determining the Type of Tests to be Given 

Achievement tests are examinations keyed to 
measuring the extent to which children learn the 
intended curriculum. Aptitude tests measure 
students’ “innate” abilities. Achievement tests 
therefore would normally be used to serve the 
objectives of an educational assessment. 
Achievement tests have greater predictive power 
for future educational levels than aptitude tests. 
“Objective” tests, such as multiple-choice or 
short-answer test formats, are mqre cost-effec- 
tive, reliable, amenable to statistical analysis, 
and easier and quicker to score than perform- 
ance tests, such as long essay examinations or 
skill demonstrations. However, performance- 
type items help to ensure the teaching and learning 
of writing, original thinking, analysis, and syn- 
thesis, and should be included if financially and 
technologically feasible. Criterion referenced 
tests measure whether particular or prescribed 
standards are met. Norm referenced tests com- 
pare and rank student learning with average 
(mean) achievement levels. Criterion referenced 
tests are normally the preferred choice for as- 
sessments since they are designed to examine 
students’ mastery of the system’s educational 
objectives. The performance standards of crite- 
rion referenced examinations will need to be 
made explicit and widely reported and under- 
stood. 

Preparing and Administering the Test 

The preparation of a detailed examination 
syllabus for each subject area and grade-level to 
be assessed is the first step in instrument devel- 
opment. Objectives listed in the examination 
syllabus are normally distributed across all of 
the content, process, and context areas judged to 
be of fundamental importance. How much em- 
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phasis to place on recall of factual information 
and terminology, and comprehension of basic 
concepts and ideas, versus higher-order thinking 
skills, problem analysis, and the application of 
knowledge and skills will need to be decided. 
Table 1 providesanexampleoflevelsof thinking 
skills. Test items will need to be generated by 
trained item writers to correspond to the content 
and process areas identified in the examination 
syllabus. Assessment instruments will also 
normally measure a variety of student, classroom, 
school, and community input and cost items 
determined on the basis of the assessment 
objectives and the feasibility of gathering such 
information. Draft assessment booklets with 
accompanying answer blanks will need to be 
prepared in multiple forms containing the 
background questionnaire, test items, and 
administration instructions. Strict quality con- 
trols and test security controls should oversee 
the entire preparation process. A rigorous full- 
fieldpilot testconducted in a representative sub- 
sample of classrooms is needed to iron out 
difficulties in the examination booklets and ad- 
ministration procedures, and to provide guid- 
ance for preparing the final versions of the 
examination booklets. A decision will need to 
be taken with regard to whether to test a sample 
or a universe of students. If the assessment is not 
also part of an effort at certification or selection, 
and is not designed to provide feedback to every 
teacher about his/her class, then a scientific 
sample of students would achieve the objective 
of improving learning at a fraction of the cost 
and effort of a full assessment of the student 
population. However, if the sampling is not done 
rigorously, then the assessment may provide 
inaccurate and confusing results. For an assess- 
ment based on a sample, a stratified cluster 
sample is normally most appropriate. Areas likely 
to be less responsive than average would need to 
be over-sampled and private schools could be 
included for comparison purposes. Multiple 
choice items will need to be scored mechani- 
cally and essay items read and scored by 
professional readers backed up by supervisors 
and a reliability system. Items will need to be 



analyzed forqualityanddiscriminability, initially 
on the basis of percentages and means, and later, 
depending on the testing agency ’ s capacity, using 
more sophisticated techniques. 

Disseminating and Following upon Test Results 

What information to report, to whom it should 
be reported, and how to report it is the first step 
in the design of a national assessment. Reports 
of results will need to be bold, clear, direct, 
concise, short, journalistic, balanced, graphi- 
cally illustrated, non-technical, tailored to 
specific audiences, disseminated to the smallest 
level consistent with the sample, and presented 
by background characteristics. For maximum 
impact reports and results would need to be 
incorporated into pre-and in-service teacher 
training and other programs and into guidelines 
for teaching and supervision. 

Determining Costs 

Table 2 provides very rough estimates of the 
relative costs of sampling compared to uni- 
verses and “objective” compared to performance 
tests. Depending on the modality selected and 
the size of the universe, the costs for, say, a 
primary level test in Spanish and mathematics 
could range from as little as US$100 000 to as 
much as US$40 million. The least expensive 
system is one with “objective” tests directed at a 
small scientifically selected sample. The most 
expensive system is a “performance” based 
system directed at the universe of students. 

With few exceptions, the costs of undertaking 
even a large scale assessment are always a very 
small fraction (less than 0.1%) of the total bill 
for the educational system, of which mostof the 
costs are for salaries. However, these costs are 
discretionary, with uncertain and sometimes 
politically sensitive results. 

It is quite important to prepare realistic cost 
estimates from the start for a five to ten year 
program and to try to get a commitment from 
central government authorities to adequately 
finance the assessment system over that period 
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Table 1 



A TAXONOMY OF EXAMINATION QUESTIONS 



Question 


Level of Thinking 




Examples 


Category 


Required from Student 






1. Recall 








Who? Whatl? 


Recall of facts, observations 


1. 


What is the capital of Chile? 


Where? When? 


or definitions. 


2. 


When did Paraguay become 
independent? 


2. Comprehension 








Describe, compare 


Giving descriptions, stating 


1. 


Describe in your own 


contrast, explain, 


main ideas, comparing, and 




words the theme of 


rephrase, translate. 


contrasting. 


2. 


a story. 

Compare life in urban and 
rural areas of the country. 






3. 


interpret a graph. 


3. Application 








Apply, solve, 


Applying appropriate rules and 


1. 


Juan and Marta can do a 


classify, select. 


techniques to solve problems; 




piece of work in 15 days. They 




recognize principles in new 




work together for six days. 




situation. 




Then Marta leaves and Juan 
finishes the work in a further 
30 days. In how many days can 
Juan do the piece of work alone? 


4. Analysis 








Identifying causes 


Making inferences, finding 


1. 


What is Garcia Marquez’ 


or reasons, drawing 


evidence to support 




views of society in his novel 


conclusions, inferences 


conclusions. 




One Hundred Years of 


deductions. 




2. 


Solitude? 

Following your 
experiments, what is your 
conclusion about the 
factors affecting the growth 
of seedlings? 


5. Synthesis 








Predict, propose, 


Solving problems, making 


1. 


What would happen if 


plan, write. 


predictions. Producing original 




there was drought? 




communications. 


2. 


What actions could the 
government take to 
control population 
growth? 


6. Evaluation 








Judge, evaluate, 


Judging the value of ideas, of a 


1. 


Do you think that 


decide, appraise. 


solution to a problem, of the 




newspapers influence public 




merit of art and literature. 


2. 


opinion? 

Should the government take 
action to control population 
growth? 



Source: The World Bank, Examination Systems in Africa , 1991, adapted from Perrott (1982). 
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